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Abstract 

We consider the stochastic analysis of information ranking algorithms of large 
interconnected data sets, e.g. Google's PageRank algorithm for ranking pages 
on the World Wide Web. The stochastic formulation of the problem results in 
an equation of the form 

JV 

R = Q + ^2CiRi, 

t=i 

where N, Q, {Ri}i>i, {C, Ci}i>\ are independent non-negative random vari- 
ables, {C, Ci}i>\ are identically distributed, and {Ri}i>i are independent 

D 

copies of R; = stands for equality in distribution. We study the asymptotic 
properties of the distribution of R that, in the context of PageRank, represents 
the frequencies of highly ranked pages. The preceding equation is interesting 
in its own right since it belongs to a more general class of weighted branching 
processes that have been found useful in the analysis of many other algorithms. 
Our first main result shows that if ENE[C a ] = 1, a > and Q, N satisfy 
additional moment conditions, then R has a power law distribution of index 
a. This result is obtained using a new approach based on an extension of 
Goldie's (1991) implicit renewal theorem. Furthermore, when N is regularly 
varying of index a > 1, ENE[C a ] < 1 and Q,C have higher moments than 
a, then the distributions of R and N are tail equivalent. The latter result is 
derived via a novel sample path large deviation method for recursive random 
sums. Similarly, we characterize the situation when the distribution of R is 
determined by the tail of Q. The preceding approaches may be of independent 
interest, as they can be used for analyzing other functionals on trees. We also 
briefly discuss the engineering implications of our results. 
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1. Introduction 

We consider a problem of ranking large interconnected information (data) sets, e.g., ranking pages on the 
World Wide Web (Web). A solution to the preceding problem is given by Google's PageRank algorithm, 
the details of which are presented in Section 11.11 Given the large scale of these information sets, we adopt 
a stochastic approach to the page ranking problem, e.g. Google's PageRank algorithm. The stochastic 
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formulation naturally results in an equation of the form 

jv 

R = Q + Y,d^ (1-1) 

i=l 

where N, Q, {Ri}i>i, {C, C;}i>i are independent non-negative random variables, P(Q > 0) > 0, {C, C;}i>i 

are identically distributed, and {Ri}i>i arc independent copies of R; = stands for equality in distribution. 
We study the asymptotic properties of the distribution of R that, in the context of PageRank, represents 
the frequencies of highly ranked pages. In somewhat smaller generality, the preceding stochastic setup was 
first introduced and analyzed in |35] for the PageRank algorithm; the formulation given in p. II) was later 
studied in [34] . 

The canonical representation given by recursion is also of independent interest since it belongs to 

a more general class of weighted branching processes (WBPs) [301 125; the connection to WBPs is 
discussed in more detail in Section 12.31 With a small abuse of notation, we also refer to our more restrictive 
processes as WBPs. These processes have been found useful in the average-case analysis of many algorithms 
[31j . e.g. quicksort algorithm |15j . and thus, our study of recursion (jl.ip may be useful in these types of 
applications. Furthermore, when Q = 1, C, = 1, the steady state solution to represents the total number 
of individuals born in an ordinary branching process. Also, by letting N be a Poisson random variable and 
fixing Q = 1, Ci = 1, equation p.l[) reduces to the recursion that is satisfied by the busy period of an M/G/l 
queue. Similarly, selecting N — 1 yields the fixed point equation satisfied by the first order autoregressive 
process; see Section [2.31 for a more thorough discussion on related processes. 

In Section [5] we connect the iterations of recursion (jl.ip to an explicit construction of a WBP on a tree, such 
that the sum of all the weights of the first n generations of the tree are directly related to the nth iteration of 
the recursion. Then, in Section [3] we present explicit estimates for the moments of the total weight, W n , of 
the nth generation in the corresponding WBP. Using these moment estimates and the WBP representation, 
we show in Section [3. II that under mild conditions the iterations of converge in distribution to a unique 
and finite steady state random variable R. Hence, under the stated assumptions, this limiting distribution 
P(R < x) is the unique solution to The steady state variable R represents the sum of all the weights 

in the corresponding branching tree. 

Studying the asymptotic tail properties of the constructed steady state solution R to represents the main 
focus of this paper. In particular, we study the possible causes that can result in power tail asymptotics 
for P(R > x). We discover that the tail behavior of R can be determined/dominated by the statistical 
properties of any of the three variables C,N and Q. The corresponding results are presented in Sections HI 
[5] and [HI respectively. Our emphasis on power law asymptotics is motivated by the well established empirical 
fact that the number of pages that point to a specific page (in-degree) on the Web, represented by N in 
recursion (jl.ip . follows a power law distribution; other complex data sets, e.g. citations, are found to posses 
similar power law properties as well. 

Our first main result on the tail behavior of P(R > x) is presented in Theorem 14.21 showing that if 
ENE[C a ] = 1, a > and Q, N satisfy additional moment conditions, then R has a power law distribution of 
index a, with an explicitly characterized constant of proportionality. In particular, when a is an integer, the 
constant of proportionality of the power law distribution is explicitly computable, see Corollary 14. II This 
result is obtained by an extension of Goldie's (1991) implicit renewal theorem that we present in Theorem l4.ll 
This extension may be of independent interest since R and C in the statement of Theorem 14. II can be any two 
independent random variables that may satisfy a different recursion. In the context of the broader literature 
on WBPs, our results are related to the studies in [30] (see Theorem 6), and more recently in [2], both of 
which study recursion (jl.ip using stable law methods when Q, {Ci} are deterministic constants. However, 
these deterministic assumptions fall outside of the scope of this paper; for more details see the discussion 
in Section 12.31 and the remarks after Theorem 14.21 Outside of these results, the majority of the work on 
WBPs considers the homogeneous equation (Q = 0), e.g. in [55] the behavior of the distribution of R was 
characterized using stable- law distributions for < a < 1. Also, related results for the homogeneous case 
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(Q = 0) and a > 1 can be found in Theorem 2.2 of [57] and Proposition 7 of [TS]. Interestingly, our approach 
for the nonhomogeneous case (P(Q > 0) > 0) shows that the distribution of R can have a uniform treatment 
for any a > 0. For additional comments on results related to our Theorem 14.21 see the remarks following 
its statement. Furthermore, this result may provide a new explanation of why power laws are so commonly 
found in the distribution of wealth since weighted branching processes appear to be reasonable models for 
the total wealth of a family tree. 

Section [5] studies the case when N is power law and dominates the tail behavior of R. This is the case that 
more closely relates to the original formulation of PageRank and the structure of the Web graph since the 
in-degree N is well accepted to be a power law. Our main result in this case, stated in Theorem 15.11 shows 
that, when N is regularly varying of index a > 1, ENE[C a ] < 1 and Q,C have higher moments than a, 
then the distribution of R is tail equivalent to that of N. Our approach in deriving this result is based on 
a new sample path heavy-tailed large deviation method for weighted recursions on trees. The key technical 
result is given by Proposition 15. II that provides a uniform bound (in n and x) on the distribution of the total 
weight of the nth generation P(W n > x). We would also like to point out that Proposition 15.11 resembles to 
some extent a classical result by Kesten (see Lemma 7 on p. 149 of [5]), which provides a uniform bound for 
the sum of heavy-tailed (subexponcntial) random variables. The main difference between the latter result 
and our uniform bound is that n refers to the depth of the recursion in our case, while in Lemma 7 of 
[5] , n is the number of terms in the sum. This makes the derivation of Proposition 15.11 considerably more 
complicated, and perhaps implausible, if it were not for the fact that we restrict our attention to regularly 
varying distributions, as opposed to the general subexponcntial class. 

Section [S] investigates a third possible source of heavy tails for R, the one that arises from the innovation, Q, 
being power law; see Theorem 16. II For N = 1, this result is consistent with a corresponding result for the 
first order autoregressive process in Lemma A. 3 of |28j . The proofs of more technical results are postponed 
to Section [7] 

Finally, from a mathematical perspective, we would like to emphasize that our sample path large deviation 
approach as well as the extension of the implicit renewal theory, provide a new set of tools that can be of 
potential use in other applications, as well as in studying the broader class of recursions on trees, e.g., one 
can readily characterize the asymptotic behavior of the distribution that solves R = Q + maxi<j<jv CiRi. 
Furthermore, from an engineering perspective, our Theorem 15.11 shows that for highly ranked pages, the 
PageRank algorithm basically reflects the popularity vote given by the number of references N, implying that 
overly inflated referencing may be advantageous. A more detailed discussion on the engineering implications 
of the performance and design of ranking algorithms, e.g. PageRank, can be found at the end of Section [5] 

1.1. Google's algorithm: PageRank 

PageRank is an algorithm trademarked by Google, the Web search engine, to assign to each page a numerical 
weight that measures its relative importance with respect to other pages. We think of the Web as a very 
large interconnected graph where nodes correspond to pages. The Google trademarked algorithm PageRank 
defines the page rank as: 



where, using Google's notation, p\,p2, ■ ■ ■ ,p n are the pages under consideration, M(pi) is the set of pages 
that link to pi, L(pj) is the number of outbound links on page pj, n is the total number of pages on the Web, 
and d is a damping factor, usually d = 0.85. As noted in the original paper by Brin and Page (1998) [TT] 
PageRank "can be calculated using a simple iterative algorithm, and corresponds to the principal eigenvector 
of the normalized link matrix of the Web. Also, a PageRank for 26 million web pages can be computed in 
a few hours on a medium size workstation." Other link-based ranking algorithms for web pages include the 
HITS algorithm, developed by Kleinberg [23], and the TrustRank algorithm [T7J. 

While in principle the solution to (|1.2[) reduces to the solution of a large system (possibly billions) of linear 




(1.2) 
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equations, we believe that finding page ranks in such a way is unlikely to be insightful. Specifically, if one 
obtains the principal eigenvector of the normalized link matrix, it is hard to obtain from the solution quali- 
tative insights about the relationship between highly ranked pages and the in-degree/out-degree statistical 
properties of the graph. 

In particular, the division by the out-degree, L(pj) in equation (jl.2[) . was meant to decrease the contribution 
of pages with highly inflated referencing, i.e., those pages that basically point/reference possibly indiscrim- 
inately to other documents. However, the stochastic approach (to be described in the following sections) 
reveals that highly ranked pages are essentially insensitive to the parameters of the out-degrce distribution, 
implying that the PagcRank algorithm may not reduce the effects of overly inflated referencing (citations, 
voting) as originally intended, i.e., it may lead to possibly unjustifiable highly ranked pages. An analytical 
explanation as to why the tail of the rank distribution is dominated by TV was first given in [35] and [34| . 
More discussions on this topic are provided at the end of Section [5] 

A stochastic approach to analyze (|1.2p is to consider the recursion 

i — 1 1 

where 7, c > are constants, cE[l/D] < 1, N is a random variable independent of the Ri's and D^s, the £Vs 
are iid random variables satisfying Di > 1, and the Ri's are iid random variables having the same distribution 
as R. In terms of recursion (|1.2[) . R is the rank of a random page, N corresponds to the in-degree of that 
node, the R^s are the ranks of the pages pointing to it, and the ZVs correspond to the out-degrees of each 
of these pages. The experimental justification of these independence assumptions can be found in |33j . This 
stochastic setup was first introduced in |35| , where the process resulting after a finite number of iterations 
of (|1.3p was analyzed. More recently, in a follow up paper [54"] . the more general recursion 

N 

R = Q + Y. CM 

i=i 

was analyzed via tauberian theorems for the cases when N or Q dominate. In [33], dependancy between 
N and Q is allowed, but additional moment conditions are imposed. Recall that in the setup considered 
here N, Q, {i?i}i>i, {C, Ci}i>i are independent non-negative random variables, P(Q > 0) > 0, {C, Ci}i>i 
are identically distributed, and {Ri}i>i arc independent copies of R. 

2. Model Description 

As outlined above, we study the sequence of random variables that are obtained by iterating (|1.1[) . Specifi- 
cally, we consider for n > 

N n 

R-n+l = Qn + Cn,iRn,iJ (2-1) 
i=l 

where {i?* j}i>i are iid copies of i?* from the previous iteration, and {N n },{C n ,i},{Q n } are mutually 
independent iid sequences of random variables; for n = 0, Rq i are iid copies of the initial value Rq. 

In this section we will discuss the weak convergence of i?* to a finite random variable R, independently of the 
initial condition Rq. In other words, R is the unique solution to under the assumptions of Lemma 13.41 
In particular, we will construct a process on a tree that converges a.s. to R. These convergence results 
may be of practical interest as well since ranking algorithms are implemented recursively. The actual proofs 
are postponed until Section 13.11 
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2.1. Construction of R on a Tree 

To better understand the dynamics of our recursion, we give below a sample path construction of the random 
variable flona tree. First we construct a random tree T. We use the notation to denote the root node of 
T, and A n , n > 0, to denote the set of all individuals in the nth generation of T, Aq = {0}. Let Z n be the 
number of individuals in the nth generation, that is, Z n = \A n \, where | • | denotes the cardinality of a set; 
in particular, Zq = 1. We iteratively construct the tree as follows. Let be the number of individuals 
born to the root node and let AT( ), i }„>i be iid copies of N. Define now 

Ai = {i : 1 <t < iV (0) }, An = {(h,i2,...,i n ) ■ . . . £ A^. x ,l<i n < N^' 1 }^}, 

and then the number of individuals Z n = \A n \ in the nth generation, n > 1, satisfies the following branching 
recursion 

Z n = V A^" 1 ] 



(i 1 ,...,i n - 1 )eA„ 



Suppose now that individual . . . , i n ) in the tree has a weig ht . defined via the recursion 

where = 1 is the weight of the root node and the random variables {Cj™ j : n > 0, ik > 1} arc iid 

with the same distribution as C. Note that C^ n ' i is equal to the product of all the weights along the 
branch leading to node (ii, . . . , i„), as depicted on the figure below. Define now the process 

where A n is the set of all individuals in the nth generation and {Q^ ;„}n>o is a sequence of iid random 
variables having the same distribution as Q (see Figure [l}, and independent of C. . 




Figure 1: Construction on a tree 



Observe that when C. = 1 and Q = 1, W n is equal to the number of individuals in the nth generation 
of the corresponding branching process, and in particular Z n ~ W n . Otherwise, W n represents the sum of 
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the weights of all the individuals in the nth generation. Related processes known as weighted branching 
processes have been considered in the existing literature [3UJ UHl HI] and are discussed in more detail in 
Section |2~51 With a small abuse of notation we also refer to our more restrictive processes as WBPs. 

Define the process {R^} n >a according to 

n 

R {n)=Y / W k , n>0, 

fc=0 

that is, is the sum of the weights of all the individuals on the tree. Clearly, when Q. = 1 and C. = 1, 
R' n ) is simply the number of individuals in a branching process up to the nth generation. We define the 
random variable R according to 

oo 



R± lim ifcW =^Tw k . (2.2) 
Furthermore, it is not hard to see that R^ satisfies the recursion 



R (n) = Y j Cf ) Rf- l) +Q(°), (2.3) 

3=1 



for n > 1, where {Rj } are independent copies of R^ n ^ corresponding to the tree starting with individual 
j in the first generation and ending on the nth generation; note that = ■ 
Moreover, since the tree structure repeats itself after the first generation, W n satisfies 



N (0) 

= E^ E 

fe=l (k,...,i„)eA„ 
N 

= E CkW n -i y k, 
fe=l 

where N, Ck, W n -\ : k arc independent of each other and of all other random variables, W n -i,k nas the same 
distribution as W n l x , and cg|^ = UU i.e., if C« > 0, then c£J n = C^JjC^. 

2.2. Connection between R* n and R( n ) 

We now connect the two processes R* n and R™, the one obtained by iterating and the one obtained 
from the tree construction, respectively. To do this define 

W n (Ro) = E ^0,(ii,...,i n ) C i™...,in' 
(h,—,in)eA n 

(n) (n) 

where -Kg (■) are n< ^ copies of the initial condition Rq, independent of C. , and the weights C. are the ones 
defined in Section |2~T1 In words, W n (Ro) is the sum of all the weights in the nth generation of the tree with 
the coefficients Q. substituted by the corresponding R^ ^. We claim that 

R* n = R( n -V + W n (R* ). 
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To see this note that for n = 1, 

N JV< > 

R*i = Qo + J2 ( '■> ■"■'■ ■ = (0)c(0) + J2 c i 1)R h = R(0) + w ^ R o) ( rcca11 c(0) = !). 

i=l i=l 

and by induction in n, 

JV„ 

i=l 
N n 



Q„ + J] ^n,i(^i + (by induction) 



jV<o) 



i=1 \ (i,ii,...,i„)eA„ + i 



N 



(o) 



(n+1) 



i=l (i,-ti,...,i„)eA„ + i 

where i2j corresponds to the process obtained from the tree starting with individual i in the 

first generation (a descendent of the root) and ending on the nth generation, and (i?-" -1 ', W n ,i(Ro)) are iid 
copies of {R^-V, W n (RD). Since R^^ -> i? a.s., it will follow from Slutsky's Theorem (see Theorem 1, 
p. 254 in Q2j) that if W n (R^) => 0, then 

-R* =>■ R, 

where =>■ denotes convergence in distribution. The proof of this convergence and that of the finitcness of R 
are given in Section 13.11 Understanding the asymptotic properties of the distribution of R, as defined by 
|), is the main objective of this paper. 



2.3. Related Processes 

As we mentioned earlier, the stochastic equation defined in (jl.ip leads to the analysis of a process known in 
the literature as a weighted branching process (WBP). WBPs were introduced by Rosier [30] in a construction 
that is more general than ours. More precisely, each individual in the tree has potentially an infinite number 
of offsprings, and each offspring inherits a certain (nonncgative) weight from its parent and multiplies it by 
a factor Ti, where the index i refers to his birth order (i.e., a first born multiplies his inheritance by T\, a 
second born by T%, etc.). Each individual branches independently, using an independent copy of the sequence 
Ti, T2, .... However, within the sequence, Ti, T2, . . . can be dependent. Only individuals whose weights are 
different than zero are considered to be alive. The construction we give in this paper would correspond to 
having 

Ti = Cil(jv>i)- 

The definition of a WBP described above leads to the following stochastic recursion for the total weight of 
the nth generation, 

00 

W n = EWn-l,i (2.4) 
i=l 

and a corresponding nonhomogeneous fixed point equation of the form 

00 

v 



R = J2TiRi + Q. (2.5) 
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In the construction given in [3D], the {Ti} and Q are allowed to be dependent as well. 

We now briefly describe some of the existing literature on WBPs, most of which considers the homogeneous 
equation, i.e. Q = 0. The nonhomogeneous equation has only been studied for the special case when Q and 
the {Ti} are deterministic constants. In particular, Theorem 5 of [30] analyzes the solutions to (|2.5[) when Q 
and the {Ti} are nonnegative deterministic constants, which implies that T, < 1 for all i and J^i l°g^i < 
for all a > 0, falling outside of the scope of this paper. More results about the solutions to (|2.5p for the case 
when Q and the TVs are real valued deterministic constants were derived in [2J. 

Regarding the homogeneous equation, in |30| . the martingale structure of W n /m n (m = E[^2 i Ti]) was used 
to point out the existence of W = lim„_ ) . 00 W n /m n , and it was shown that positive stable distributions with 
a G (0,2) arise when B E 4 T°] = 1 and some additional moment conditions are satisfied. Furthermore, 
for a detailed analysis of the case when W follows a positive stable distribution (0 < a < 1) see [2BJ. The 
convergence of W n /m n to W was studied in [35], and conditions for W to belong to the domain of attraction 
of an a-stable law (1 < a < 2) were given in [35], along with an analysis of the rate of convergence. A 
generalization of the WBP described in |30] to a random environment was given in [24] , where necessary and 
sufficient conditions for W to be nondegenerate were derived. The existence of moments of W was studied 



1 and a > 1 was derived in Theorem 2.2 



in PQ. The power law tail of W for the critical case E J2i=i Q 
of [27] and Proposition 7 of [18] . For an even longer list of references to WBPs and related work see [24] and 
®- 

From the discussion above it is clear that the prior literature on WBPs is extensive, but we point out that the 
more specific structure of our model, given by as well as our novel analysis via implicit renewal theory, 

allow us to characterize the asymptotic power law behavior of the distribution of R for all a > when the 
{Ci} dominate the tail. In addition, we study the nonhomogeneous equation (|2.5p . while the preceding work 
primarily focuses on the homogeneous case (|2.4p . The case when N dominates the tail, which is important 
for the page ranking problem, has not been considered until very recently in [35] and [33] . In reference to the 
latter work, our analysis is based on a new sample path approach, while the studies in [351 134] use transforms 
and tauberian theorems as well as somewhat different assumptions. We will provide more details on these 
connections throughout the paper in remarks after the corresponding theorems. 

From a different mathematical perspective, our model also constitutes a generalization of several important 
types of processes. For instance, by setting N = 1, (|2.1[) reduces to an autoregressive process of order one. 
Also, by letting N be a Poisson random variable and fixing d = 1, Q = 1, p. II) becomes the recursion that 
the number of customers in a busy period of an M/G/l queue satisfies. Recursion (|1.2[) and its connection 
to the busy period when the weights Di arc equal to a deterministic constant was exploited in |25| . 

It is worth noting that probabilistic sample path approaches for the busy period (Ci = 1, Q = 1) were 
developed in (35] HH [Z] ; the work in JSHJ US] is also relying on the theory of cycle maximum [3J. However, for 
our more general model (random C^'s) it is not clear if there is a tractable way of generalizing this analysis. 
Instead of pursuing the preceding directions, we develop a direct sample path large deviation analysis for 
recursive random sums that provides greater generality. 



3. Moments of W, 



In this section we provide explicit estimates for the moments of the total weight, W n , of the nth generation 
that will be used throughout the paper. In particular, we apply these estimates in Section [3. II to prove that 
i?* => R where R < oo a.s. Our estimates may be of independent interest due to their explicit nature. 

A simple calculation shows that provided E[N], E[Q], E[C] < oo, then -E[W„] < oo and is given by 

E[W n ] = E[N]E[C]E[W n -i] = (E[N]E[C]) n E[W ] = (E[N]E[C]) n E[Q]. 
We give below upper bounds on the general moments of W n . 

Throughout the paper we will use K to denote a large positive constant that may be different in different 
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places, say K = K/2, K = K 2 , etc. 

Lemma 3.1. Suppose E[Q P ]E[N]E[C P ] < oo for < /3 < I, then 

E[W%] < (E[C p ]E[N]) n E[Q f) ] 

for all n > 0. 

Proof. Simply note that 



E[Wi) = E 



N 



\i=l 



(3.1) 



and use the well known inequality f^ZiLi Vij — J2i=i Vi for < /3 < 1, yi > (see e.g., Exercise 4.2.1, 



p. 102, in [12]). 



□ 



The lemma for moments greater than one is given below. 

Lemma 3.2. Suppose E[Q P ] < oo, E[N^] < oo, and E[N] max{E[C ], E[C]} < 1 for some (3 > 1. Then, 
there exists a constant Kp > such that 

E[W%] < K fj (E[N\wBx{E[C f> ],E[C\}) n 

for all n > 0. 

The proof of Lemma 13.21 is given in Section 17.11 

Remark: Recall that when C = 1 and Q = 1 then is the /3-moment of a subcritical branching 

process Z n and our result reduces to E[Z%] < Kp{E[N]) n , which is in agreement with the classical results 
from branching processes, e.g. see Corollary 1 on p. 18 of [Bj. Moreover, from the proof of the integer /3 
case (given in Section rTTTj) . it is clear that E[W^} scales as p* 3 " if j o /3 > pp and as p"^ if p* 3 < pp, where 
p = E[N]E[C] and pp = i?[A^]i?[C' 3 ]. Note that this is not quite the same as our upper bounds, and the 
reason we choose the geometric term (p V pp) n instead is that it makes the proofs simpler and is sufficient 
for our purposes. Similar techniques to those used in proving the preceding lemmas can yield, with some 
additional work, lower bounds for the /3-moments of W n , showing that the correct leading term is (p^ V pp) n . 

More technical results dealing with the existence of the /3-momcnts of W = lirrin^oo W n /p n can be found 
in PQ. There, necessary and sufficient conditions are given for the fmiteness of E\W^ L{W)\ when (3 > 1 
and L(-) is slowly varying (see Theorems 1.2 and 1.3). In particular, the approach the authors take is to 
first normalize the process so that p = E[W\] = 1, and then impose a condition that in our case reduces to 
Pp = E\N\E\pP\ < 1, that is, they preclude the situation where Wfi might scale as p 7 ^ when p 13 < pp. An 

example where -E[W^] scales as pp is when N = 1, since then = Q 13 nlLi ■ 

Furthermore, observe that when p = 1 and pp < 1 for /3 > 1, our proof of the lemma shows that 
limsupyj^^ -E[W^] < oo, but it does not converge to zero, which is in agreement with [T]. However, since 
we study R, it is necessary to have p < 1 for the fmiteness of E[RP). Otherwise, if p = 1, pp < 1, > 1, 
then E[R^] = nE[Q] which by monotone convergence and (|2.2[) implies that E[R] — oo, and therefore, by 
convexity, E[R^] = oo. 
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3.1. Convergence of R* and finiteness of R 



As discussed in Section |2~21 there are two issues regarding the process i?* that remain to be addressed. One, 
is the proof that 



R* n => R = W k 



fc=0 

for any initial condition R^; the other one is the finiteness of R. The lemma below shows that R < oo a.s. 

Lemma 3.3. Suppose that E[QP] < oo, E[N P ] < oo, and either E[N)E[C ] < 1 for some < (3 < 1, or 
E[N] m&x{E[C], ElC 13 ]} < 1 for some (3 > 1. Then, E{R~<] < oo for all < 7 < f3, and in particular, R < 00 

a.s. Moreover, if (3 > 1, R^ -4- R, where Lp stands for convergence in (E\ ■ p) 1 /^ 3 norm. 
Proof. Let 



'/ 



E[N]E[C% 



if /3 < 1 



E[N]ui&yL{E[C},E[CP]}, if/3>l. 



Then by Lemmas 13.11 and 13.2 



for some K > 0. Suppose f3 > 1, then, by monotone convergence and Minkowski's inequality, 



(3.2) 



E[R fj ] = E 



km (y^Wk 



\k=0 



= lim E 



/ n \f j / oo \l 3 

^ E^<H <K[T,V k/(3 ) <oo. 



\k=0 



This implies that R < 00 a.s. When < (3 < 1 use the inequality (Y^2=o V k )^ — Y^k=oVk f° r an y Hi — ® 
instead of Minkowski's inequality. Furthermore, for any < 7 < (3, 

E[R~<] = E Ur 13 )-*'?] < (E[R l3 ])' </fi < 00. 

That *4 R whenever (3 > 1 follows from noting that E[\R,^ - Rf) = E (Y,T=n+i W *>) P an d applying 
the same arguments used above to obtain the bound E[\R^ — R\^] < Krj n+1 /(1 — rj 1 ^) 15 . □ 



Next, by monotone convergence in equation (|2.3|) it can be verified that R must solve 



A' 



R = °i R i + <5. 



; = 1 



where {Ri}i>i are iid copies of R, independent of N, Q, and {Ci}. 

We now turn our attention to the proof of the convergence of i?* to R. Recall from Section |2~21 that 



R*=R( n -V + W n (RZ), 



(3.3) 



where 



(ii,...,i„)eA„ 

The following lemma shows that i?* =>■ R for any initial condition Rq satisfying a moment assumption. 
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Lemma 3.4. For any R* > 0, if ElQ 13 } < oo, E[(R*) } < oo and E[N]E[C^] < 1 for some < (3 < 1, then 

i?* => R, 

with E[R ] < oo. Furthermore, under these assumptions, the distribution of R is the unique solution with 
finite j3 moment to recursion 

Proof. In view of (|3.3[) , and since W- n > —> R a.s., the result will follow from Slutsky's Theorem (see 
Theorem 1, p. 254 in [T2]) once we show that W^Rq) => 0. Recall that W u {Rq) is the same as W n if we 
substitute the Qi u ...,i n by the R^ ,^ 4 v Fix e > 0, then 

P(W n (R* ) >e)< e- f3 E[W n {R* f] 

< e~ t3 (E[C l3 ]E[N]) n E[{R* )P] (by LemmaE]). 

Since by assumption the right hand side converges to zero as n — > oo, then i?* => R. Furthermore, E[R^\ < oo 
by Lemma 13.31 Clearly, the distribution of R represents the unique solution with finite j3- moment to (11.11) . 
since any other possible solution would have to converge to the same limit. □ 

Remarks: (i) Note that when E[N] < 1, then the branching tree is a.s. finite and no conditions on the C's 
are necessary for R < oo a.s. This corresponds to the second condition in Theorem 1 of |10j . (ii) In view 
of the same theorem from [TO], one could possibly establish the convergence of i?* => R < oo under milder 
conditions. However, since the conditions that we will impose on N, Q and C in the main theorems will be 
stronger, this lemma is not restrictive. Furthermore, the initial values, Rq, are typically small (e.g. constant 
in applications), and thus the polynomial moment condition imposed on Rq is general enough. 



4. The case when the C's dominate: Implicit renewal theory 

In this section we study the power law phenomenon that arises from the multiplicative effects of the weights 
{Ci} in (HU). 

4.1. Implicit Renewal Theorem on Trees 

One observation that will help gain some intuition about (|2.3|) is to consider the case when N = 1. The 
process {R^} then reduces to a (random coefficient) autoregressive process of order one, whose steady state 
solution satisfies 

R = Q + CR, 

where R is independent of C and Q. This is precisely one of the stochastic recursions considered in [TB] (see 
also jH]), where it is shown that under the assumption that E[C a ] = 1 and some other technical conditions 
on the distribution of C and Q, we have that 

P(R > x) ~ Hx- a (4.1) 

for some (computable) constant H > (see Theorem 4.1 in [Tfj]). The fact that the index of the power 
law depends on the distribution of the weights is already promising in terms of our goal of identifying other 
sources of power law behavior. 

Informally speaking, the recursions studied in |16j are basically multiplicative away from the boundary. 
However, (|1.1[) always has an additive component given by X)i=i CiRi regardless of how far from the boundary 
one may be. Fortunately, due to the heavy-tailed nature of R, our intuition says that it is only one of the 
additive C;i?; components that determines the behavior of (|1.1[) , thus the sum will behave as the maximum 
term, simplifying to 

P^Q + Yl ClRt > X J ~ E[N]P(CR > x), (4.2) 
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assuming that Q has a light enough tail. This heuristic suggests the following generalization of Theorem 2.3 
from [IB] , 

Here, we would like to emphasize that R and C in the following theorem can be any two independent random 
variables that satisfy the stated conditions, i.e., they do not have to be related by recursion Hence, 
the theorem may be of potential use in other applications. Note that we prove the theorem for a general 
constant m, that in our application refers to E[N], as suggested by 



Theorem 4.1. Suppose C > a.s., < E[C a logC] < oo for some a > 0, and that the conditional 
distribution o/logC given C ^ is nonarithmetic. Suppose further that R is independent of C , mE[C a ] = 1, 
and that E[R^\ < oo for any < (3 < a. If 

poo 

/ \P{R>t)-mP{CR>t)\t a - 1 dt<oo 1 (4.3) 
Jo 

P(R>t)~Ht- a , t^-oo, 



then 



where H > is given by 

1 f°° 

H = mn«y n\ / v ( P ( R > v )- mP ( CR > «)) dv - 
mE[C a log C\ J 



The proof of this theorem follows the same steps as Theorem 2.3 from JTB], and is presented in Section 17721 

Remarks: (i) As pointed out in [16], the statement of the theorem has content only when R has infinite 
moment of order a, since otherwise the constant H = (aE[N]E[C a log C])~ 1 (E[R a ] - E[N]E[(CR) a }) will 
be zero by independence of R and C. (ii) Note that some of the assumptions of Theorem 14.11 are different 
than the corresponding ones from Theorem 2.3 in |16j . In particular, it is no longer the case that convexity 
implies E[C a logC] > whenever a solves mE[C a ] = 1 and E[C a logC] < oo, since if m > 1 it is possible 
to construct counterexamples, hence the need to include this as an assumption. Another difference is our 
requirement that E[R^\ < oo for any < < a. In the case of applying Theorem 14.11 to equation (|1.1[) . 
the condition on E[R^\ is not restrictive since we readily obtain the moments of R for < /3 < a from the 
computed moments of W n from Section [3l (iii) A similar result for the case when log C is lattice valued can 
be derived using the corresponding renewal theorem. 

In what follows we will use the preceding theorem to derive the asymptotic behavior of P(R > x) where R, 
as given by (|2.2p . satisfies (|1.1[) . Here, the main difficulty will be to show that condition (|4. 3|) holds. For 
brevity we use x V y to denote max{x, y} and x A y to denote min{x, y}. 



Theorem 4.2. Suppose that < E[C a log C] < oo for some a > 0, the conditional distribution of log C 
given C ^ is nonarithmetic, and that C and R are independent, where R is defined by (|2.2[) . Assume that 
E[N]E[C a ] = 1, < E[Q a ] < oo and some < e < 1; if a > 1 assume further that 

E[N]E[C] < 1. Then, 

P(R> t) - Ht-°, t^oo, 



where 



H = 



E[N]E[C«logC] J 



v^iPiR >v)- E[N]P(CR >v))dv 



E 



aE[N]E[C a logC] 
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Remarks: (i) Note that the second expression for H is more suitable for actually computing it, especially 
in the case of a being an integer, as will be stated in the forthcoming corollary, (ii) When a is not an 
integer we can derive an explicit bound on H by using the forthcoming Lemma 14.31 and (|4.5p . (ii) For the 
homogeneous equation (Q = 0) and a > 1, closely related results to our theorem can be found in Theorem 

2.2 of [57] and Proposition 7 of [T5]. The approach from [57] transforms the recursion W = Xiti 
for the critical case E[W] = 1, E Yli=i Ci = 1 to a first order difference (autoregressive) equation on a 
different probability space, see Lemma 4.1 in [57] . Note that the tail behavior of W does not imply that of 
R. Furthermore, it appears that the method from [57] does not extend to the nonhomogeneous case since 

the proof of Lemma 4.1 in [57] critically depends on having both i5[W] = 1 and E J2iLi Q = 1> which is 
only possible when Q = 0. For < a < 1, the homogeneous equation was studied in [55] using stable laws, 
(iii) Related results for the nonhomogeneous equation with deterministic constants Q,{Ci}, N — oo, have 
been considered in [50] (see Theorem 5), and more recently in [5], also using stable laws, (iv) Moreover, 
the results obtained in the references cited above appear to be less explicit in the expression for H than 
the statement of Theorem 14.21 as Corollary 14.11 below illustrates, (v) Furthermore, Theorem 14.11 and the 
preceding technique of Theorem 14 . 21 can be adapted to analyze other, possibly non-linear, recursions on trees, 
e.g., one can characterize the asymptotic behavior of P(R > x) that solves 

R = Q + max CiRi. 

l<i<N 

We also want to point out that one can obtain the logarithmic asymptotics of R, that is, the behavior of 
log -P(i? > x), much easier and under less restrictive conditions, e.g. logCi needs not be nonarithmetic (this 
condition is required because of the use of the Renewal Theorem). An upper bound can be obtained from 
Lemma 13.31 and Markov's inequality. For the lower bound, using the notation from Section [2. 11 we obtain 



P(R > x) > P(W n >x)> P\ max CiW„_i * > x 

\l<i<N 

= E[(1- P(CW n -i < x) N )} 

> E [NPiCWn-x < x) N ] P(CW„_i > a:), 

where in the last step we used the relation 1 — t m > mt m (l — t) for < t < 1. Now we use the fact that 
P{CW n -i < x)> P{R < x), for all x, to show that 

P(R >x)>E [NP(R < x) N ] P{CW n -i > x) 

> E [NP(R < x) N ] P (cx m^C 2 ,iWn-2,i > x) 

> E [NP(R < x) N ] E [NPid^Wn^ < x) N ] P(CiCW„_ 2 > x), 
which, by using P(C 1 C2W„_2 < x) > P(R < x), for all x, yields 

P(R>x) > (E [NP{R<x) N ]f P{CiC 2 W n - 2 >x). 
Next, by continuing this inductive argument we obtain 

P(R >x)>(E [NP(R < x) N ]) n P (q JJC 4 > x^j . 

Finally, for any < e < 1, we can choose xq such that E [NP(R < xq) n ] > (1 — e)E[N], implying that for 
all n > and x > xq, 

P{R>x) > (l-e) n (E[N]) n P [QJICl > x) > P{Q > 1/ log x){\ - e) n {E[N]) n P [ JJC, > x\ogx\ . 
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Now define S n = logCH hlogC„, k(6) = \ogE[C e ], and choose a to be the solution to n(a) = — log E[N\ 

(i.e. i?[iV]i?[C Q ] = 1). The proof can be completed by choosing n = n(x) = log(xloga;)//i a , where 
fi a = «'(a) = E[C a logC]/£;[C Q ] > k'(0) = E[logC] by convexity of «(•). Then, by Theorem 2.1 in Chapter 
XIII in [4], 

Uminf l °g p ( R > x ) > log((l-e)£;[JV]) logP(5„ (:c) > fi a n(x)) _ log(l - e) 



log 2 fj, a x^-co fl a n(x) /I a 

Hence, one can derive with a considerably smaller effort the following theorem. 

Theorem 4.3. Suppose that < E[C a log C] < oo for some a > 0, and i/iai R is given by (|2.2j) . Assume 
that E[N]E[C a ] = 1, < E^-EfiV ] < oo; if a > 1 assume /wrt/ier that E[N]E[C] < 1. TTien, 

log P(i? > t) — -a log i, i oo. 



Therefore, the majority of the work in proving Theorem 14 . 2 1 goes into the derivation of the exact asymptotic. 
Furthermore, it is worth noting that the logarithmic approach, although less precise, can be obtained in a 
more general setting. For example, one can have to be dependent across different generations, as in 
the so called WBP in a random environment. Here, one could derive the asymptotics of log P(R > x) if 

satisfies the polynomial type Gartner-Ellis conditions that were recently considered 



E 



(nr=i c (i,_ 



in [20] 

Corollary 4.1. For integer a > I, and under the same assumptions of Theorem \4-2\ the constant H can be 
explicitly computed as a function of E[R k ], E[C k ], E[Q k ], < k < a — 1. In particular, for a = 1, 

H _ E[Q] 



E[N]E[C log C}' 
and for a = 2, 

= E[Q 2 } + 2E[Q]E[C}E[N]E[R] + E[N(N - l)]{E[C]E[R]f = E[Q] 

O TT\ ATI Z7T/^2 1„„. /~f\ y -I 



2E [N}E[C 2 log C] ' 1 J 1 - S[AT]£7[C] ' 

Proof. The proof follows directly from multinomial expansions of the second expression for H in Theo- 
rem HH □ 



Before giving the proof of Theorem 14.21 we state the following three preliminary lemmas. The proof of 
Lemma 14.21 is given in Section 17.11 and the proof of Lemma 14.31 is given in Section 17.21 



Lemma 4.1. Suppose that < E[C a logC] < oo for some a > and E[N]E[C a ] = 1; if a > 1 suppose 
further that E[N]E[C] < 1. Assume also that E[Q a ] < oo, E[N aV1 ] < oo. Then, 

E[RP] < oo 

for all < (3 < a. 

Proof. The derivative condition < E[C a logC] < oo and E[N]E[C a ] = 1 imply that E^ElC 13 } < 1 
for all (3 < a that are sufficiently close to a. Hence, the conclusion of the result follows from Lemma |3~31 □ 



Lemma 4.2. Let (3 > 1 and p ■ 

{Y, y 4 }i>i and any k G {1, 2, 3, . 



E 



\^ G {2,3,4, 
. } we have 

f k \/ 



. } . For any sequence of nonnegative iid random variables 



k 

E 



Y 



< k f, E\Y p - 1 ]Mb- 1 '>. 
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Lemma 4.3. Suppose {C, Cj} and are iid sequences of nonnegative random variables independent 

of each other and of N. Assume that E[C a ] < oo, E[N 1+f '] < oo for some < e < 1, and E[R^\ < oo for 
any < /3 < a. Then, 



< (e[N]P{CR >t)-P ^iiim C i R l > tj^j t a - x dt=-E i c i R i)°' ~ {^^ N CiR 



< oo. 



Proof of Theorem \4-2\ By Lemma H. II we know that E[R^] < oo for any < (3 < a. The statement of 
the theorem with the first expression for H will follow from Theorem 14 . 1 1 once we prove condition (|4.3I) for 
to =E[N]. Define 



JV 



Then, 



\P(R >t)- E[N]P{CR > t)\ < 



i=l 



P(R >t)-P\ max C i R l > t 

1 Ki<N 



P ( max^ dRi > t ) - E[N]P(CR > t) 



v 



Since R = R* > maxi<;<Ar CiRi, the first absolute value disappears. For the second one note that by the 
union bound 



E[N]P(CR >t)-P[ max dRi >t) =E [NP(CR > t) - 1 + P(CR < t) N ] > 0. 



It follows that 



\P{R >t)- E[N}P(CR >t)\< P(R >t)-P( max C i R l > t 



E[N]P(CR >t)-P[ m^ CiR, > t 



Note that we only need to verify that 



since 



\P{R >t)-P ^max^CiRi > t^j i Q_1 dt < oo, 

J (e[N]P(CR >t)-P f max^ C l R i >t ] j^j t a - x dt < oo 

by Lemma 14.31 To see this note that R = R* and lrR*>t) — l(max 1<i <j V CiRi>t) ^ 0: thus, by Fubini's 
Theorem, we have 

J fp(R >t)-P (msx N C l R l >tj\ t 01 - 1 dt = ^-E {R*) a - ^max^ Q R^\ . (4.4) 
If < a < 1 we apply the inequality (Y^ =1 Xij < J^i=i for < /? < 1, Xi > 0, to obtain 



E 









(R*) a - 


( max CiRi I 


< E 


\l<i<N J 





N 



Q a + V(C,i? 4 ) Q - max C t R 

\Ki<N 
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which is finite by Lemma 14.31 and the assumption E[Q a ] < oo. If a > 1 we use the well known inequality 

Ik \ a k 

x i ) — x ?j x i — ( see Exercise 4.2.1, p. 102, in [T2]) to split the expectation as follows 



E 



(R*) a - max QR, 

1 Ki<N 



= E 



N 



(R*r - ]r (dRi)" 



E 



N 



y {CiRi) a - max C,R 

^ \Ki<N 



which can be done since both expressions inside the expectations on the right hand side are nonnegative. The 
second expectation is again finite by Lemma l4.3l To see that the first expectation is finite let S = X)i=i Ci-Ri 
and note that R* — S + Q, where S and Q are independent. Let p = \a\ and note that 1 < p — 1 < a. 
Then, by Lemma |4"T21 



E 



N 



(R*r-Y,( C > R *Y 



= E[(S + Q) a -S a ]+E 



N 



N 



<E[(S + Q) a -S a ]+E [N a ] (E[(CR) p - 1 ]) a/(p - 1) . 



The second expectation is finite since by Lcmma EOl E[R^] < oo for any < f3 < a. For the first expectation 
we use the inequality 

'x K +t K , 0<«<1, 
x K + k(x + t) K ~ 1 t, K>1, 



(x + t) K < 



for any x,t > 0. We apply the second expression p — 1 times and then the first one to obtain 

p-2 p-l 

(x + t) a < x a +a(x + t) a -H <■■■< x a +^a l x a - l t + a p ' 1 {x + t) a ~ p+1 i p ~ 1 < x a + a p t a + a p x °" lf 



i=l 



i=l 



We conclude that 



p-i 



E [(S + Q) a - S a ] < a p E[Q a ] + a p J2 E[S a -' l ]E[Q% 



(4.5) 



where E[S a -' 1 } < E[(R*) a -' 1 } < oo for any 1 < i < p - 1 by Lemma 
Finally, applying Theorem 14.11 gives 

P(R > t) ~ Hr a , 

where H = (E[N]E[C a logC])" 1 J °° w a ~ 1 (P(i? > v) - E[N}P(CR > v))dv. 
To obtain the second expression for H note that 



l (P(R >v)- E[N]P(CR > v)) dv 



v - 1 E 



(E™ i CiRi+Q>v) 



E 



N 



Ri>v) 



dv 



E 



E 



l -E 

a 



J2i =1 CiRi+Q 



N 



N rCiRi 



J q ^ 1 ^(EiLi C,R,+Q>v) - E l {CiRi>v) \ dv 

N 

N 

Y^CiRt + Q] -]T(M) Q 



r^dv 



(4.6) 
(4.7) 
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where (|4.6[) is justified by Fubini's Theorem and the absolute integrability of v a 1 (P(R > v) — E[N]P(CR > 
v)), and (|4.7|) is justified from the observation that 

JV 

" a_ll (E"iC,fl.+Q>») and l ' a ~ 1 ^2 1 (c,R l >v) 

i=l 

are each almost surely absolutely integrable as well. This completes the proof. □ 

5. The case when TV dominates 

We now turn our attention to the distributional properties of and R when TV has a heavy-tailed 

distribution (in particular, regularly varying) that is heavier than the potential power law effect arising 
from the multiplicative weights {C;}. This case is particularly important for understanding the behavior of 
Google's PagcRank algorithm since the Cj's are smaller than one and the in-degree distribution of the Web 
graph is well accepted to be a power law. We start this section by stating the corresponding lemma that 
describes the asymptotic behavior of R^ . The main technical difficulty of extending this lemma to steady 
state (R = R(°°') is to develop a uniform bound for R — RS n >, which is enabled by our main technical result 
of this section, Proposition 15. II The following lemmas are proved in Section [731 

Before stating the lemmas, let us recall that a function L : [0, oo) — > (0, oo) is slowly varying if L{\x) / L{x) — > 
1 as x — > oo for any A > 0. We then say that the function x~ a L(x) is regularly varying with index a. 

Lemma 5.1. Suppose that P(N > x) = x~ a L(x) with L(-) slowly varying, a > 1, and E[Q a+t ] < oo, 
E[C a+e ] < oo for some e > 0. Let p = E[N]E[C] and p a = E[N]E[C a ]. Then, for any fixed n G {1,2,3,...}, 

P{R (n) > ,) „ immr £ A (1 _ ^ P(N > x) (5 . 1} 

^ P> k=0 

as x — > oo, where R^ was defined in Section \2.1\ 

Lemma 5.2. Suppose that P(N > x) = x~ a L(x) with L(-) slowly varying, a > 1, and E[Q a+t ] < oo, 
E[C a+e ] < oo for some e > 0. Let p = E[N]E[C] and p a = E[N]E[C a ] . Then, for any fixed n G {1, 2,3,...}, 

n-l 

P(W n >x)~ {E[C]E[Q]) a ^p (n - 1_fc)a P(JV > x) 
as x — > oo, where W n was defined in Section \2.1\ 

From this result, provided p V p a < 1, it is to be expected that a bound of the form 

P(W n >x)< Kn n P(N > x) 

might hold for all n and x > 1, for some p\/ p a < r\ < 1. Such a bound will provide the necessary tools to 
ensure that R— R^ is negligible for large enough n, allowing the exchange of limits in Lemma l5.ll Proving 
this result is the main technical contribution of this section; the actual proof is given in Section 17.31 This 
bound may be of independent interest for computing the distributional properties of other recursions on 
branching trees, e.g. it is straightforward to apply our method to study the solution to 

R = Q + max CiRi, 

l<i<N 



and similar recursions. 
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Proposition 5.1. Suppose P(N > x) = x~ a L(x), with L(-) slowly varying and a > 1, E[C a+v ] < oo, 
E[Q a+v ] < oo for some v > 0, and let E[N]niax{E[C' a ], E[C]} < n < 1. Then, there exists a constant 
K = K{rj, v) > such that for all n > 1 and all x > 1, 

P(W n > x) < Kr] n P(N > x). (5.2) 



We would also like to point out that a bound of type (|5.2[) resembles a classical result by Kesten (see Lemma 7 
on p. 149 of [5]) stating that the sum of heavy-tailed (subexponential) random variables satisfies 

P{X 1 + ■ ■ ■ + X n > x) < K(l + e) n P(X 1 > x), 

uniformly for all n and x, for any e > (see also |14j for more recent work). The main difference between 
this result and (|5.2[) is that while n above refers to the number of terms in the sum, in (|5.2p it refers to 
the depth of the recursion. This makes the derivation of (|5.2[) considerably more complicated, and perhaps 
implausible if it were not for the fact that we restrict our attention to regularly varying distributions, as 
opposed to the general subexponential class. 

In view of (|5.2[) . we can now prove the main theorem of this section. 



Theorem 5.1. Suppose P(N > x) = x a L(x), with L(-) slowly varying and a > 1. Let p = E[N]E[C] and 
p a = E[N]E[C a ]. Assume p V p a < I, and E[C a+t ] < oo, E[Q a+e ] < oo for some e > 0. Then, 



as x — > oo, where R was defined by (|2.2[) . 

Remarks: (i) A related result that also allows Q and to be dependent was derived very recently in 
[34] using transform methods and tauberian theorems under the moment conditions E[Q] < 1, E[C] = 
(1 — E[Q])/E[N]. (ii) Note that this result implies the classical result on the busy period of an M/G/l 
queue derived in j!3) . Specifically, the total number of customers in a busy period B satisfies the recursion 

B = 1 + Bi, where the Bi's are iid copies of B, N(t) is a Poisson process of rate A and S is the service 

distribution; {Bi}, N(t) and S are mutually independent and p = E[N(S)} < 1. Now, the recursion for B is 
obtained from our theorem by setting C = 1 and Q = 1, implying that P(B > x) ~ P(N(S) > x)/(l — p) a+1 . 
Next, one can obtain the asymptotics for the length of the busy period P by using the identity B = N(P). 
This can be easily derived, in spite of the fact that N(t) and P are correlated, since N(t) is highly concentrated 
around its mean. For recent work on the power law asymptotics of the GI/GI/1 busy period see [36] . (iii) 
In view of Lemma 15.11 the theorem shows that the limits \im x _j.oo linin^oo P(R^ > x)/P{N > x) arc 
interchangeable . 



Proof of Theorem \5.1\ Fix < 5 < 1 and no > 1. Choose p V p a < n < 1 and use Proposition 15.11 to 
obtain that for some constant Kq > 0, 

P{W n >x)< K r] n P{N > x) 

for all n > 1 and all x > 1. Let H { d l) = (E[C}E[Q}) a (l - p)~ a Y2=Q P^i 1 - P n ~ k )° and H a = • Then, 

\P(R > x)-H a P(N > x)\ 

< P(R >x)- P(i? ( " o) > x) (5.3) 

+ P(i?(™ > >x)- H^P(N > x) (5.4) 

+ H^-Ha P(N>x). (5.5) 
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By Lemma 15.11 there exists a function tp(x) 4- as x — > oo such that 



P(i? (no) > x) - H^ 0) P(N > x) < ip(x)H a P(N > x 



r( ra o) 



To bound (O let /3 = ■q l ^ 2a+2S > < 1 and note that 



P{R > x) - P{R {no) > x) < P [R^ + {R- R {no) ) > x, R - i? ( " o) < Sx) - P(R [no) > x) 

+ p(R- R (na) > 6x^ 



< P(R in "^ > (1 - 6)x) - P(R^ > x) + Pi W n >Sx 

\n=n +l J 

< P(R^ > (1 - S)x) - Hi n °1p(N > (1 - S)x) 

+ H^P(N >x)- P(R (na) > x) 

+ Hi n ^P(N > (1 - 8)x) - H^ no) P(N > x) 



P(W n >5x(l-p)P 



n — no — 1 > 



n— tiq + I 



< \2 ( p((l-6)x) 



P(N > (1 - S)x) 



P(N > x) 

OO 



n— rio + 1 



where in the last inequality we applied the uniform bound from Proposition 15.11 The expression in curly 
brackets is bounded by 



2^(1 - S)x)(l 5)-<* m T ;* )x) + ((1- S)-° m -? x) - 1 I (1 - 6)- 1 



L(x) 



L(x) 



as x — > oo. By Potter's Theorem (see Theorem 1.5.6 (ii) on p. 25 in [5]), there exists a constant A = A(l) > 1 
such that 



n— no + 1 

oo 

<K A £ rf (<5(1 - p)f3 n - n °- 1 y a ~ 1 P(N > x) 

71— rio+1 

= K A(6(l - /3))- a " 1 (l - n 1/2 )- 1 rf >0+1 P(N > x) 
< KS-v-^PiN > x). 
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Next, for (|5.5p simply note that 

P> o) - H n 



1 

H a 



\k=0 k=0 / 

no co 

= (i- PQ )X:^(i-(i-p' io - fe r) + (i-p Q ) E p k o 

k—0 k— no+1 

< [a(l-p Q )(7i + l)+p Q ](p Q Vp)™° 

< Krf 10 . 



Finally, by replacing the preceding estimates in (|5.3[) - (|5.5|) . we obtain 

P(i? > x) 



lim 



F a P(iV > a;) 



1 



< (i -<5)- Q -i + #<r a -yr 



Since the right hand side can be made arbitrarily small by first letting hq — > oo and then 5 4-0, the result of 
the theorem follows. □ 



Engineering implications. Recall that for Google's PageRank algorithm the weights are given by C, = 
c/Di < 1, where < c < 1 is a constant related to the damping factor and the number of nodes in the Web 
graph, and Di corresponds to the out-degree of a page. We point out that dividing the ranks of neighboring 
pages by their out-degree has the purpose of decreasing the contribution of pages with highly inflated 
referencing. However, Theorem 15.11 reveals that the page rank is essentially insensitive to the parameters of 
the out-degree distribution, which means that PageRank basically reflects the popularity vote given by the 
number of references N. This same observation was previously made in [34| . 

Furthermore, Theorem 14.11 clearly shows that the choice of weights Cj in the ranking algorithm can determine 
the distribution of R as well. Note that for the PageRank algorithm the weights Cj = c/Di < 1 can never 
dominate the asymptotic behavior of R when N is a power law. Therefore, Theorem 23] suggests a potential 
development of new ranking algorithms where the ranks will be much more sensitive to the weights. 



6. The case when Q dominates 

This section of the paper treats the case when the heavy-tailed behavior of R arises from the {Qi}, known 
in the autoregressive processes literature as innovations. The results presented here are very similar to those 
in Section [SJ and so are their proofs. We will therefore only present the statements of the results and skip 
most of the proofs. We start with the equivalent of Lemmas 15.11 and 15.21 in this context; their proofs are 
given in Section 17.41 

Lemma 6.1. Suppose P(Q > x) = x~ a L(x) with L(-) slowly varying, a > 1, and E[N a+e ] < oo, E[C a+e ] < 
oo for some e > 0; let p a = E[N]E[C a ] . Then, for any fixed n G {1,2,3,... }, 



n 

P(R^> X )~Y l P k a P(Q>x) 
as x — > oo ; where R^ was defined in Section \2.1\ 
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As for the case when N dominates the asymptotic behavior of R, we can here expect that 

P(R>x)~(l-p a )- 1 P(Q>x), 

and the technical difficulty is justifying the exchange of limits. The same techniques used in Section El can 
be used in this case as well. Therefore, we give a sketch of the arguments in Section 17.41 but omit the proof. 
The following is the equivalent of Lemma 15.21 

Lemma 6.2. Suppose P(Q > x) = x~ a L(x) with L(-) slowly varying, a > 1, and E[N a+e ] < oo, E[C a+e ] < 
oo for some e > 0; let p a = E[N]E[C a ]. Then, for any fixed n € {1, 2, 3, . . . }, 

P(W n > x) ~ P n a P(Q > x) 

as x —> oo ; where W n was defined in Section \2. 1\ 

The corresponding version of Proposition 15.11 is given below. 

Proposition 6.1. Suppose P(Q > x) = x~ a L{x), with L(-) slowly varying and a > 1, E[C a+y ] < oo, 
E[N a+v ] < oo for some v > 0, and let E[N]max{E[C a ], E[C]} < n < 1. Then, there exists a constant 
K = K(ij, v) > such that for all n > 1 and all x > 1, 

P(W n >x)< Kn n P{Q > x). 
A sketch of the proof can be found in Section 17.41 

And finally, the main theorem of this section. The proof again greatly resembles that of Theorem 15.11 and 
is therefore omitted. 

Theorem 6.1. Suppose P(Q > x) = x~ a L(x), with L(-) slowly varying and a > 1. Let p = E[N]E[C] and 
p a = E[N]E[C a ]. Assume p V p a < 1, and E[C a+t ] < oo, E[N a+e ] < oo for some e > 0. Then, 

P(R>x)~(l-p a )- 1 P(Q>x) 

as x — > oo, where R was defined in (|2.2p . 

Compare this result with Lemma A. 3 in |28j . where the autoregressive process of order one with regularly 
varying innovations is shown to be tail-equivalent to Q. In particular, if we set N = 1 in Theorem 16.11 and 
let Ak = Yli=i Ci, our result reduces to 

(oo \ oo 

Y, A kQk > x ~ ]T E[A%\P{Q > x), 
k=0 ) k=0 

which is in line with the commonly accepted intuition about heavy-tailed large deviations where large sums 
are due to one large summand Qk- 

7. Proofs 

This section contains the proofs to most of the technical results presented in the paper, together with 
some auxiliary lemmas that are needed along the way. The section is divided into four subsections, each 
corresponding to the content of Sections El SI El an d El respectively. 
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7.1. Moments of W n 

Here we give the proof of the moment bound for the /3-moment, /3 > 1, of the sum of the weights, W n of the 
nth generation. As an intermediate step, we present a lemma for the integer moments of W n , but first we 
give the proof of Lemma 14.21 

Proof of Lemma\4^ Let p= \0] G {2, 3, . . . } and 7 = (3/p G (0, 1]. Define A p (k) = . . . ,j k ) G Z fc : 
jH h Jfc = P, < ji < p}. Then, 



' k \PT 

E« 

^=1 / 



E^f+ E 

, J=1 0'i,..-jfc)eA p (fe) 



V 

3i, ■ ■ ■ ,3k 



Vi ■ ■ ■ vl" 



< 



E: 



,P7 



E 



P 

h , ■ ■ ■ , 3k 



vi 1 ■■■ v 3 k I . 



where for the last step we used the well known inequality ( X)i=i x ij — Si=i x l f° r < 7 < 1 and xi > 
(see the proof of Lemma l3~Tj) . We now use Jensen's inequality to obtain 



E 



E* -Etf 



< E 



< E 



E 

y Ui,---,3k)£A p (k) 

E 

(ji,...j fc )eA p (fe) 



P 

h, ■ ■ ■ ,3k 



p 

h, ■ ■ ■ ,3k 



y3i , . , Y lk 



E 





yii . 




\3i, ■ ■ ■ ,3k) 







Since the {Yi} are iid, we have 



E 



\\Y\fc-\\Y\fo, 



where ||y|| K = E[\Y\ K }^ K for k > 1 and ||r|| = 1. Since is increasing for ft > 1 it follows that 



\\Y\\f. < ll^llp-i- It follows that 



which in turn implies that 



\\Y\^---\\Y\^<\\Y\E 1 , 



E 



E« -Etf 



< 



E 

y Uu---Jk)eA p {k) 
= \\Yf p _ 1 (kr-ky 
<\\Y\\"kP. 



P 

.31, ■ ■ ■ ,3k 



\\Y\\U 



□ 
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Lemma 7.1. Suppose E\QP] < oo, E[NP] < oo, and E[N] max.{E[C p ], E[C}} < 1 for some p€ {2,3,...}. 
Then, there exists a constant K p > such that 

E[W P ] < K p (E[N]max{E[C] 1 E[C p ]}) n 

for all n > 0. 

Proof Let Y = CW n -i, where C is independent of W n -\ and let {1^} be independent copies of Y. We 
will give an induction proof in p. For p = 2 we have 



N 



= S[iV]S[F 2 ] + £[iV(iV - 1)}(E[Y}) 2 

= E[N\E[&]E\Wl_ 1 ] + E[N(N - l)](E[C\E[W n -r}) 2 . 

Using the preceding recursion, letting p = E[N]E[C], p% = E[N]E[C 2 ], and noting that, 

E[W n - 1 }=p n - 1 E[Q], 

we obtain 

E[W 2 ]=p 2 E[W 2 _ 1 ]+Kp 2 ^- 1 \ 
where K = E[N(N - 1)]{E[C]E[Q]) 2 . Now, iterating flTU) gives 

E[Wn] = P2 (P2E[WZ_ 2 ] + Kp 2 <- n - 2 A + Kp 2 ^ 



(7.1) 



n — 2 



= P r 1 {p 2 E[W 2 \ + A') + A' ^2 P\ P 2[n - X - l) 

n-l 

= p^E[Q 2 ]+K^2P2P 2{n - 1 - l} 

i=0 

n-l 

< V p) n E[Q 2 } + K(p 2 V P ) n J^iP^ V p) n " 



2-i 



i=0 



< I E[Q 2 } + _5L J> 2 V p)j ) (pa V p)' 1 



P2 V p 



3=0 



= K 2 {p 2 V pY 



Next, for any p € {2, 3, ... } let p p = E[N]E[C P ]. Suppose now that there exists a constant K p _\ > such 
that 

ElWfr 1 ] < AVi(p P -iVp) n (7.2) 

for all n > 0. By Lemma T4. 2 1 we have 



E[W p ]^J2 E 



k=l 



P(7V = &) 



OO 



fe=l 



= E[N]E[C]E\W^_ 1 \ + E[N^{E[C^ 1 ]y^ 1 \E\W^Zl]) p/<r ~ 1) 

< + J B[7V ?, ](E[C ? '- 1 ]) p /( p - 1 )(A' J) _ 1 ) p /^- 1 )(p ?) _ 1 vp)^/^ 1 ), 
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where the last inequality corresponds to the induction hypothesis. We then obtain the recursion 
where K = E[NP](E[C p - 1 ]y/( p -V (Kp-iy/te-V. Iterating CDJ]) as for the case p = 2 gives 



n-l 



E\W£] < p^E[QP} + K J2 Pi (Pp-i V p)~ 

n-l 

< (p p V p) n E[Q p ] + KJ2(P P V p)' 



i=0 



n-l 



= (p p V p) n E[QP] + K{p p V pY Y,(Pp v rf W 

< \ E[QV} + K( Pp V pY 1 f> p V p)^\ ( Pp V P y 
= K p (p p Vp) n . 



(7.3) 



□ 



The proof for the general /3-moment, > 1, is given below. 

Proof of Lemma \3 '."A Set p = \0] > j3 > 1. Since the result when p = (5 follows from Lemma [7.11 we 
assume that p > j3. Let Y = CW n -i, where C is independent of W n -\ and {Yi} are independent copies of 
Y. Also, recall that p = E[N]E[C] and p p = E^A^C^]. Then, by Lemma E2J 



E[Wg] = E 



N 



k=i 

oo / 

Eh 



P(AT = fc) 



fe=i 



E y -E^ 



Etf 



P(AT = fc) 



< ^ (fc /3 p[r p - 1 ]' 3/(p_1) + fc-E [y 3 ]) p(a^ = k) 



fc=i 



P[A^' 9 ](P[C p - 1 ])' 3/(p - 1) (P[WPZ 1 ]) /3/(p - 1) + ppElW^}. 



Then, by Lemma 17.1 



= ^. 1 ] + %_ 1 v P )("- 1 ^, 
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where 7 = (3 / {p — 1) > 1. Finally, iterating the preceding bound n — 1 times gives 

Tl-l 

E\Wg] < f$E\W$] + K P}(P v Pp-iV^- 1 ^ 

i=0 

n-l 



< E[W1 3 ]( P V pp) n + V pp)^- 1 ' 



i=0 

n-1 

= E[QP](p V pp) n + K(p V ppT- 1 Y,iP V Pp) h - 1)l 

i=0 

<Kp(pVp ) n . 

This completes the proof. □ 
7.2. The case when the C's dominate: Implicit renewal theory 

In this section we state a lemma that is used in the proof of Theorem l4.1l and we give the proofs to Theorem l4.ll 
and Lemma 14.31 



Lemma 7.2. Let a,0 > and H > 0. Suppose J*v a+p - 1 P(R > v)dv ~ Ht f3 /p as t -> oo. Then, 

P(R > t) ~ Ht~ a , t^oo. 

Proof. This lemma is a special case of the Monotone Density Theorem, see Theorem 1.7.5 (also Exer- 
cise 1.11.14) in (§]. However, for completeness, we give a direct proof here, similar to the one of Lemma 9.3 
in |16j . By assumption, for any b > 1, e S (0, 1), and t sufficiently large, 

p(r > t)t a+ ? ■ ba+l3 = 1 > [ u v a +?- l P(R >v)dv> - ^i±A t p 

a + p J t p p 

>j{H(bf>-l)-e{l + bP)). 

Since e was arbitrary, we can take the limit as e — > and obtain 

■gM f(Ji >, ra g yf-^ g , n, 

Similarly, one can prove that lim sup^^ P(R > t)t a < H starting from f, v a+ P~ l P(R > v) dv with 
< b < 1. □ 

Proof of Theorem \4-l\ For any k <E N define H k = IIi=i Cj an d = Si=i^°g^i with IIo = 1 and 
Vq = 0, where the Cj's arc independent copies of C. Then, for any tgK, 

n 

P(R >e t )=J2 (m k - 1 P{U k ^ 1 R > e*) - m k P(U k R > e*)) + m™P(II„i? > e 4 ) 
fe=i 

n 

= £ (m*-^^*-^ > e*) - m k P{e Vk ~^C k R > e 4 )) + m n P(e v "R > e 4 ) 

n 1 />oo 

5^ m fc / (P(P > e 4 "") - mP(CR > e*-")) P(V fe e dv) + m n P(e v "R > e 4 ). 

1. n J —OO 



k=0 
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Next, define 



v n (dt) = e at m k P(V k G dt), g(t) = e at (P(R > e*) - mP(CR > e 4 )), 

fe=0 

r(t) = e at P(R > e') and 5 n (t) = m n P(e v "R > e*). 
Then, for any teR and n E N, 

r(i) = (ff*i>»-i)(i) + *»(«). 
Next, for any /3 > 0, define the smoothing operator 

ft 



/(*) 



-W-")f( u )du 



and note that 



-OO 
t 

e 

-oo 

OO ft 



/OO 
g(u-v)v n -i(dv) du + 6 n (t) 
-OO 

-^-"^(u - w) duv n -i{dv) +S n (t) 
g(t - v) w n _i(dw) + <5 n (t) 
(5 * "n-i)(*) + 



OO •/ —OO 



(7.4) 



Next, we will show that one can pass n — > 00 in the preceding identity. To this end, let rj(du) = 
e au mP(logC G du), and note that this measure places no mass at — 00. Also, by assumption, rj(-) is a 
nonarithmctic measure on K. Moreover, 



/OO 
j](du) = mE[e alosC ] = mE[C a ] = 1 
-OO 



and 



/OO 
ur)(du) = TO£[e QlogC logC] = mE[C a log C] = mfx 
-OO 

imply that r](-) is a probability measure with mean < mfi < oo. Furthermore, 

OO 

v(dt) = Y m k e at P(V k G dt) 

is its renewal measure since v{dt) = X^Lo V* n (dt)- Since m/i > 0, then (|/| * v)(t) < oo for all t whenever 
/ is directly Riemann integrable. From (|4.3[) we know that g G L 1 , so by Lemma 9.2 from |16j . g is directly 
Ricmann integrable, resulting in (\g\ * v)(t) < oo for all t. Thus (\g\ * v){t) = -E [X^^Lo mkeaVk \g(t ~ ^4)1] < 
oo. By Fubini's Theorem, E [X^o mke0tVk g{i — Vfe)l exists and 



Gl*i/)(t) = E 



,fe=o 



V £ [m fc e" yfc g(t - Vfc)l - lim (3 * u n ){t). 



k=0 
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Now, by assumption, we can choose (3 in the definition of the smoothing operator such that < [3 < a and 
mE[CP] < 1. We show below that for such /3 we have S n (t) — > as n — > oo for all fixed t, since 



S n (t)= I e~^ t - u '>m n P(e pv "R 13 > e^) du 

■ / P(e^R p > v)dv 
Jo 



e H m 



< ^-^E[B?](mE[C l3 ]) n -> 

as n —> oo. Hence, the preceding arguments allow us to pass n — > oo in (|7.4p . and obtain 

r{t) = (g*v)(t). 

Now, by the key renewal theorem for two-sided random walks in [5], 

e -0t / v a+P-ip/ R > v \ dv = jtr t \ / ft u \ du±—, t ^ oo. 

Ja m V J-oz P 

Clearly, H > since the left-hand side of the preceding equation is positive, and thus, by Lemma I7.2I 

P(R> t) ~ Ht~ a , t^oo. 

Finally, 

H =JL[ f e -^ u -^g(t)dtdu 

"W-00./-00 



1 f°° 
= / g(t) dt 

1 f°° 

= / v a - 1 (P(R >v)- mP(CR > v)) dv. 

mjjL J 



□ 

We end this section with the proof of Lemma 14.31 

Proof of Lemma \4-3\ That the integral is positive follows from the union bound. That 

(e[N]P(CR >t)-P (^swa^CiRi >tj S j t a ~ l dt = ^E ^(C,fl») a - ^m^dR^j 

follows from similar arguments to those used to derive the alternative expression for H in the proof of 
Theorem 14.21 The rest of the proof shows that the integral is finite. 

Clearly 

^E[N]P(CR >t)-P ^max^ QR > tj*j t^ 1 dt < E[N] T" 1 dt < oo. 

Hence, it remains to prove that the remaining part of the integral (J^° ■ ■ ■ dt) is finite. To do this, we start 
by letting Y = CR and F(y) = P{Y < y). Then 



E[N]P{CR >t)-P\ m»c CiR > tj = ^ (F(t) k - 1 + kF(t)) P{N = k) 



k=l 



= E [(1 - F{t)) N - 1 + NF(t)] 
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Use the inequality 1 — x < e x for x > to obtain 

E [(1 - F{t)) N - 1 + NF(t)] < E \ e - Tit)N - 1 + NF(t) 

Choose < 5 < ae/(l + e) (recall that < e < 1) and let = a — 6. By Markov's inequality and Lemma 

F(t) < t~ p E\Y p ] = t~ p E[R fi ]E[C p ] 4 cT < oo 

for any t > 0. Note that the function h(x) = e - ^ — 1 + x is increasing on [0, oo), so h(NF(t)) < h(cNt~P). 
Thus, by Fubini's Theorem (the integrand is nonnegative), 



J \E[N]P(CR >t)-P yms^dRi >tj S j t"' 1 dt < E J (e~ cNt -1 + cNt~^ t"' 1 
Using the change of variables u = cNt~@ gives 

^ ' P Jo 

< ^11 / ( e -« _ l + „) u -«/P-i du. 



dt 



Our choice of /3 = a — 8 guarantees that 1 < a/0 < 1 + e, so E[(cN) a ^} < oo. It only remains to show that 
the last (non-random) integral is finite. To see this note that e~ x — 1 + x < x 2 /2 and e~ x — 1 < for any 
x > 0, so 

/>OC -i />1 />oo 

/ (e- u - 1 + u) u- 01 ^- 1 du < - u 1 - a/p du+ U - a/f3 du 
Jo 2 J J i 

1 1 

< oo. 



2(2 -a/0) a/0-1 

This completes the proof. □ 
7.3. The case when AT dominates 

This section contains the proofs of Lemma |5. II and Proposition l5.ll the proof of Lemma 1 5. 2 1 is omitted since 
it is basically the same as that of Lemma l5.ll We also present in Lemma [7751 a result for sums of iid truncated 
random variables that may be of independent interest in the context of heavy-tailed asymptotics, since it 
provides bounds that do not depend on the distribution of the summands. Most of the work involved in the 
proof of Proposition 15.11 goes into obtaining a bound for one iteration of the recursion satisfied by W n , and 
for the convenience of the reader it is presented separately in Lemma 17.41 

Proof of Lemma \5.1\ We proceed by induction in n. For n = lfixa/(a + e)<i5<l and note that 

/at(o) 

P(RW > X ) = p f C^Rf + Q(°) > x 

/ N 



p \J2 CiQi > X -Q'Q< x j +p{Q> x s ) 
pr£c i Q i >x^J +o(V 5 ( Q + e >) 

P(N > x/E[CQ}) + o(P(N > x)) 
(E[C]E[Q}) a P(N > x), 
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where N,{Ci}, and Q are independent and the fourth step is justified by Lemma 3.7(2) from [5T]. Now 
suppose that we have 

Note that since i?[C" +e ] < oo, then by Lemma 4.2 from [2T], for C independent of R^ n \ 

P{CR {n) > x) ~ P[C Q ]P(P (,l) > x). 
Let c- 1 = E{C a ](E{C}E{Q}) a (l ~ p)- a J2 n k=0 p k a (l ~ P n ~ k ) a , then 

P(7V > a;) ~ cP{CR (n) > x), 
and by Lemma 3.7(5) from |21j we have 

p(i?(™ +1 ) > i) = p f £ cf + q(°) > x 



~ P I £ cf >itf n) 

- (£7[JV] + c(P[CP (n) ]) Q )P(CP (n) > a) 

- (P[7V] + ^[^("'D^c- 1 ?^ > a:). 

Next, observing that E[B.W] = Y%=o E Wi\ = E [Q\T^=oP i = E [Q]{1 ~ P" +1 )/(l - P), wc obtain 

(w\mj.,.(iP\rTiWx\a\ r ,-i_( n E[R^] a {l-p) a \ {E[C]E[Q]) a ^ k k 

(E[N ]+ c(E[CR ]))c -^+ WELop , (1 _, n - fe) «J (1 _„)« 2.Pa(l-P ) 



> X 



k=0 



E p£d - + (i - p-+t J ™y 



Pa 
S, fc=0 

n+1 



This completes the proof. □ 

Lemma 17731 below is based on traditional heavy-tailed techniques based on Chernoff's inequality for truncated 
random variables, such as those used in |29j and [5], to name some references. The reason why we cannot 
simply use existing results is our need to guarantee that the bounds do not depend on the distribution of the 
summands, which will be key when we apply them to W n . Hence special care goes into accounting for the 
constants explicitly. The corollary that we obtain from this lemma will be used in the proof of Lemma 17.41 



Lemma 7.3. Suppose that Yi, Y%, ■ ■ ■ are nonnegative iid random variables with the same distribution as Y , 
where E\Y^] < oo for some [3 > 0. Fix < e < 1. Then, 



1. for < (3 < 1, 1 < k < x^/E\Y% and x > e {Ke ^ 



P ( Y Y > x, max Y t < x/logx) < e -(i-«0«**)(i<*i«**)(i-Ti: 



log(eK) 
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2. for p > 1, 1 < k < (1 - e)x/(E[Y] V E[Y^\), andx>eV {Ke/ e) 2 /^-^, 

P I Yi > x, max Yi < xj log x J < e 



= - e (/3-l)(logx) 2 (l-l£fi^-^S(i££gL) +e 5 (( 3_ 1) 2 



where K = K(J3) > 1 is a constant that does not depend on e, k or the distribution ofY. 
Proof. Let F(t) = P(Y < t), set y = xj log a; and note that 

where P(rW < t) = F(tAy)/F(y). Fix 9 > 1/y and use the standard Chcrnoff's bound method for 
truncated heavy tailed sums (see, e.g. [29l [9]) to obtain 

P (lt Y * V) > x )< [e 6YiV) ] k = e- ex E [e eY l (Y < y) ] k F(y)- k . 

From where it follows that 

P (jy^ > *\ F(y) k < e-»*E [e eY l (Y < y) ] k . 

To analyze the preceding truncated exponential moment suppose first that /3 > 1. Then, by using the 
identity 

/>oo 

E[Y"] = / r]t n - x F{t)dt (7.5) 
Jo 



we obtain 



E[e 9Y l {Y <y)] =F{Q)-e ey F{y) + 8 f e et F(t)dt 

Jo 





,1/0 _ f i/e r y 

<l + 9 F(t)dt + 6 (e et - l)F(t)dt + 6 / e et F(t)dt 
Jo Jo Ji/e 

l-i/e _ [V 
<l + 0E[Y}+e0 2 tF(t)dt + e et F{t)dt 
Jo Ji/e 

P pp-i\ti rv 
<1 + 0E[Y] + - -E[Y 2 ^}+0 e et F(t)dt, (7.6) 

2 A/3 Ji/g 

where in the second inequality we use e x — 1 < xe x , x > 0, and in the last inequality we use i 2_ ( 2A ^) < 
6/ -2+(2A0) and g-^ with ^ — 2 A /3. Similarly, if < /3 < 1, then 



< 1 + - 

Next, by Markov's inequality we have 



E [e eY l {Y <y)] < 1 + / e et F{t)dt + e et F(t)dt 
Jo Ji/e 

< 1 + ^-E\Y p ] + / e 0t F{t) dt. (7.7) 
P Ji/e 



F(t) < E[Y p ]r f \ 
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which, in combination with (|7.6[) and (|7.7p , gives 



' 1 + 6E[Y] + ^-E[Y 2 ] + E[YP]6 /» e et t-l 3 dt, (3 > 2, 



H/e 

\ + E[Y0]ej? /e 

To analyze the remaining integral we split it as follows, for /3 > 0, 



E [e 6Y 1 {Y < V )] < { 1 + 0E[Y] + ^-E[Yf 3 ] + E[Y?}9 fi /9 e e H^dt, 1< /3 < 2, (7.8) 
1 + ^E[Y?] + E[YP]6 e et i-' 3 dt, < /3 < 1. 



e^di < / e 9t ^ + / e et i-^ dt 

1/0 J 1/6 Jy/2 

Jl/2 

< 9 P e 9y/2 +6y l-p 2 f3 j e 9yu du 
Jl/2 

< 9 p e 9y ' 2 + 2^%"^, 



from where it follows that 



2e9 f3 E[Y f3 ] + E[Y f3 ]9 [ e e H~^dt 



l/e 

< 2e9 p E[Y 13 } + E[Y }9 p e Oy/2 + E[Y l3 ]2 l3 e 9y y~P 

< 2^E[Y^]e%-^ (l + e2 1 -^(6yfe-^ + 2-' 3 {9yf e^' 2 ) 



< 2 p E\Y p ]e 9y y- p 1 + 2esu^e~ t + sup t f 

\ t>l t>l/2 



Hence, we have shown that 



2e9 l3 E[Y 13 ] + E[Y f3 ]9 f e et r 13 dt < KE[Y p ]e ey y~ 13 , 
Ji/e 



where K = 2 13 yl + (2e + 1) sup t>1 / 2 t@e *J does not depend on 9 or the distribution of Y. Replacing the 
preceding inequality in (|7.8[) and using 1 + t < e* give, 

{ e -0(x-fc£[Y])+ e fce 2 £[Y 2 ]+iffc£[Y< 3 ]e%-<^ /3 > 2, 
e -e( X -kE[Y])+KkE[Y?)e°yy-P t K /? < 2, (7.9) 

e -0x+KfciS[Y' 3 ]e 8! 'y-' 3 ; 0</3<l. 

Now, to complete the proof, we optimize the choice of 9 in the preceding bounds. For < (3 < 1, choose 
9 = y lo § (. KkElY^yi-e ) and notc that for a11 1 < fc < z' 3 /^' 3 ] and x > e^^'^'^ , 



Then, 



e 1 " — i* = e 

-(log 2) log 



(logx) 1 -! 3 



< e 

= (log x) (log log x) (l- (1 _'gf a 
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lino- ( (p-kE[Y])y^ \ , , , , > , n2/(/3-1) 



Now, for /3 > 1, set 6 = i log ( ^ — J an< ^ note t ' iat ^ or anc ^ x — e v (^ e / e ) 

Oy > log (^_J > log (^^J > i. 

Then, for 1< (3 < 2 and all 1 < k < (1 - e)x/(E[Y] V ^[y 3 ]), 

< e~ 

< e~ 

In addition, for (3 > 2 note that 



e08-l)(logx)»(l-i^-£!$^) 



supefc^F 2 ] < sup ^ ( log ( y — ) ) < sup ^ ~ ^ < e 5 (/3 - l) 2 . 

x>e x>eV \ \KJJ x > e X 

Finally, by combining the preceding two bounds with the first two inequalities in (|7.9p . we derive 
/ k \ 



yf>- x \\* e{p-lf{\ogx) A 



P V Yi > x, max Yi < y) < e 

\ ^ Ki<k } 



\i=l 



for any /3 > 1. □ 
As an immediate corollary to the preceding lemma we obtain: 

Corollary 7.1. Suppose that Yi,Y2, . . . are nonnegative iid random variables with the same distribution as 
Y, where E[Y@] < oo for some (3 > 0. Then, for any k > there exists a constant xo > that does not 
depend on the distribution ofY such that 

sup P 2_ \ Yi > x, max Yi < x/ logx J < x K 

l<k<mp{x) \~"[ J 

for all x > Xq, where 

i \ Sw^]' 0<P<1, 

mp{x) = { 8>l,0<e<l. 



I, E[Y]vE[Y 

Lemma T7.4I below gives a bound for the distribution of W n +i in terms of that of W n . This lemma can also 
be used to prove the corresponding uniform bound for W n in the case when Q dominates recursion In 
the statement of the lemma we assume that 1/ L(x) is locally bounded on [1, oo). 

Lemma 7.4. Suppose that P(N > x) < x~ a L{x), with a > 1 and L(-) slowly varying, and 
E[N]max{E[C a ],E[C]} < 77 < 1. Then, for any c > 0, < e < 1, and < 6 < 1 A (a - l)/2, there 
exist constants K = K(5,e,c,r]) > and xq = xq(S, e, c,rj) > 0, that do not depend on n, such that for all 
1 < n < clogx/ log rj and all x > xq, 

P(W n+1 >x)< Kr) i2Aia - 5 » n x- a L(x) + E[N]P(CW n > (1 - e)x), 



where C and W n are independent. 
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Remark: Note that the condition E[N] max{E[C a ], E[C]} < 1 is natural since it is needed for the finiteness 
of E[R@] for any (3 < a. It is also in agreement with Lemma 15. II in the sense that it is a necessary condition 
for the convergence (as n — > oo) of the sum appearing in (|5.1|) . The choice of r] is also suggested by the fact 
that for /3 < a one can obtain a weaker uniform bound by applying the moment estimate on E[W^] from 
LemmaiU i.e., P(W n > x) < E[W^}x~ f3 < Kp{E[N] max{E[C\, £[C /3 ]})"a;- /3 . 

Before going into the proof, we would like to emphasize that special care goes into making sure that K and 
Xq in the statement of the lemma do not depend on n. This is important since Lemma 17.41 will be applied 
iterativcly in the proof of Proposition 15. 1[ where one does not want K and xq to grow from one iteration to 
the next. 

Proof of Lemma\Z4\ By convexity of f{6) = E[C% msx{E[C a ], E[C}} > max{E[C a - s ], E[C}}, implying 

A V 



E[N] max{E[C a - 5 ], E[C]} 



1 >0. 



Next, recall that W n +i = J2iLi CiWn,i where W„ t i are iid copies of W n , let Y = Y t = CiW Ui i and 



a — 5 > 1. Note that by Lemma T3 . 2 1 there exists a constant K\ > (that does not depend on n) such that , 

E[Y?} = E[CP}E[W£] 

< Ki(E[N] mnx{E[C a - s },E[C}}) n 

= K 1 (l+E)- n rT, (7.10) 

where the last equality comes from the definition of e. And since E[Y] = E[Q](E[N]E[C]) n 
< E[Q](E[N] miw{E[C a - s },E[C}}) n , then 

E\Y P ] V E[Y] < K 2 (l + e)- n r) n (7.11) 

for some constant K 2 > that does not depend on n. With the intent of applying Corollary 17. 11 we define 

y = ex and <mp{x) = \ e 2 x/{E[Y p ] V E[Y])\ . 

Let Mu is the ith order statistic of {Yx, . . . , Y^}, with being the largest. Then, 



P(W n+1 >x) 



< 



< 



P {l>2 Y * > N < mp(x)J + P(N > mp{x)) 

P (j2 Yi > x ' M n N) ^ i 1 " £ ) x ' N ^ m f>( x )j 
+ P > (1 - e)x, N < m^xf) + P (N > m p (x)) 

^ P (j2 Y * >x ' M N ] ^ i 1 ~ £ ) x ' M n~ 1} ^ y/ l °Sy, N < m p (x)\ (7.12) 

+ P (m ( n n - 1] > y/logy, N < mp(i)) (7.13) 

+ P (mJ£° > (1 - e)x, N < m (x)j + P (N > m p (x)) . (7.14) 
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Note that the term in (|7. 1 2[) can be bounded as follows 

/ JV N 
pl^Yi>x, M { N N) < (1 - e)x, M { N N ~ 1] < yj logy, N < m p (x) 
\i=l / 

< p (j2 Y *- M N } > y > M n~ 1] < v/^gy, N < m (x)j 

(m 3 (x) 
V Y t > y, max Fj < y/logy 
^ l<i<m s (x) 

Fix is = a + 5 + c(a — <5), then, by Corollary 17. 1[ there exists a constant iei > e, that does not depend on 
the distribution of Y (and therefore, does not depend on n), such that 

V F > y, max Y"; < y/ logy < y _I/ = e~ v r)y^" losx x- a - 6 

< e-'ri^-^x- 01 - 6 = e- u —-i 1 fin x- a L{x) 

L{x) 



fr 



-5 

< e-"Bup^ 7T n /Sn x- a L(s) 
t>U(f) 



for all y > Xi, where the second inequality follows from the assumption n < c\ogx/ \ log 77 1, and in the second 
equality we use the definition (3 — a — 6. To bound (|7.13[) . we condition on N, 

mp{x) 

P (m { n N - 1] > y/logy, JV < mp{x)) = £ P > y/logy) P(N = k) 

k=l 

^ E LU 3 (F>y/logy) 2 P(JV = fc) 
fe=i ^ ' 

< E [N 2 \ [N < mp{x)) ] P(Y > y/logy) 2 

< E [N 2 ^] m /3 (x)( 2 -^ + P(r > y/logy) 2 , 

where in the last inequality we use N < mp{x) in case N does not have a second moment. Now, by Markov's 
inequality and the definition of mp(x), 

m,(x)^ + P(Y > y/logy) 2 < m ,(x)^ + ( ^5^1 ^ 

E[Y@] \ {2 ~ ?)+ e< 2 -' 3 ) + £[y' 3 ] 2 ^(logy) 2 ' 3 

< 



E[Y?] V E[Y] J y 2/3A(3^-2) 



e^) + E[Y^{\ogy) 2 ^ 



y 2/3A(3/3-2) 

( 2 -« + (/v 1 (l + e)-"r ? ") 2A ' 3 (logy) 2 ^ 
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Our choice of 6 guarantees that 2(3 A (3(3 — 2) > a + S and (3 = a — 5 > 1, and therefore, 

(2A/3)n 

PiM""- 1 ' >y/\osy,N <m fi (x)) <K 3 - 1 x^ 5 



(Mj? 1} > y/logy, N < m p (xj) < K 3 J 

< K3 ^Ln^ n x- a L(x) 



x 

Ux)' 

<K 3 snp^ 7 -rr ] ^ n x- a L(x) 
t>i M*J 

for all x > x 2 — t~ 1 e, where 

(logt) 2 ^ 



K 3 = K 3 (e,S) = E [N 2 ^] € <2-fi + -<*-s K W sup 



^ ^2/3A(3/3-2)-a-5 ' 

To bound the second term in (|7. 14[) . we first note that by Potter's Theorem (see Theorem 1.5.6 (ii) on p. 25 
in [8]), there exists a constant x 3 = x 3 (s, S) such that for all x > x 3 

P(N > m (x)) < (^W)_- a y^W) . x -a Hx) 
x a L(x) 

<(1 + .)~{(^)"^,(^)^}x-L(x, 

^(l +£ )max{(^™)^ (Z^^Yy-m 
<^^(E[Y^VE[Y}fx- a L(x) 

< K 4 r] pn x- a L(x). 
Finally, for the first term in (|7.14p . 

P (m$° > (1 - e)x, N < mj}(xj\ < P (m^ > (1 - e)x) 

< E[N]P{Y > (1 - e)x). 

Combining the preceding bounds for (|7.12[) - (|7.14p and setting xo = max{xi, X2, x 3 } and K = (e~ v + 
K 3 ) sup t>1 jjjj + K4 completes the proof. □ 

Finally, we give the proof of Proposition l5.il the main technical contribution of Section [5] 



Proof of Proposition I5.il Note that it is enough to prove the proposition for all x > xq for some x$ 
xq(j], v) > 1, since for all 1 < x < xq and n > 1, 

< E [Q\( E [ N \ E [ C \T X ^ np, N > x) (by Maxkov ' g inequality) 
T] n P(N > x) 
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Next, choose < e < 1 such that 

E[N]E[C a ] ((1 - e)-"- 1 + 2e) < r], (7.15) 

define c = v/2, 

7 = n r 



log 77 \£ ; [A r ]max{ J B[C Q ],£;[C]}, 

and select < <5 < min{l, (a — l)/2, C7}. Now, by Lemma [T~4l there exist constants K\,X\ > (that do not 
depend on n) such that 

P(W n+1 >x)< Kni 2A( - a ~ s ^ n P{N >x) + E[N]P(CW n > (1 - e)x) 

for all x >x\. Hence, by defining n = (2 A (a — 5) — l) _1 (log77) _1 log(eE[N]E[C a ]), we obtain 

P{W n+1 >x)< K 1 E[N]E[C a }eri n P{N > x) + E[N]P(CW n > (1 - e)x) (7.16) 

for all n> hq 7 and all x > x\. 

Next, in order to derive an explicit bound for P(W n > x), we need the following two estimates (|7.17|) and 
(I7.18[) . In this regard, choose xo > 1 V x\ such that 

P(CN > (1 - e)x) < E[C a ](l - ey^PiN > x) (7.17) 

for all x > x . This is possible since by Lemma 4.2 from [21] P(CN > (1 - e)x) - E[C a ](l - e)- a P(N > x). 
Also, by Markov's inequality, wc have that for all 1 < n < clogx/| log 77! , 

P{C > (1 - e)x/x ) < E[C a+l, ]{\ - e)- a - v x* + " x- a - v 

- HL lX ° -x-^ 2 P(N>x) 



(1 - e) a +»x v / 2 L(x)' 

pfr<a+i/] a+v 



where in the second inequality we use x v l 2 = x c = 77 1 lo « ^ <rf l . Now, define 



if 2 = max < 1 , K\ , sup 



E[C a+u ]x^ +u 



>£ e£7[C"](l-e)°+"W a £(a;) 

Now we proceed to derive bounds for P(W n > x) for different ranges of n. For all 1 < n < no and all x > xq, 
by Lemma \5. 11 there exists a constant Kq > K2 such that 

P(W n > x) < K n n P(N > x). (7.19) 

Next, for the values no < n < cloga;/| \ogrj\ wc proceed by induction using (|7.16|) . To this end, suppose 
(|7.19p holds for some n in the specified range. Then, note that by (|7.18p and the induction hypothesis (|7.19[) , 
we have for all x > xo, 

P(CW n > (1 - e)x) < P(CW„ > (1 - e)x, C < (1 - e)x/x ) + P(C > (1 - e)x/x ) 

r(l-e)x/x 



< I ° P(W n > (1 - e)x/y)P(C € dy) + K 2 E[C a ]eifP{N > x) 
Jo 

poo 

< K if / P(N > (1 - e)x/y)P(C 6 dy) + K 2 E[C a ]en n P{N > x) 

Jo 

= K oV n P(CN > (1 - e)x) + K 2 E[C a }en n P(N > x) 

< K E[C a ] ((1 - e)- a - 1 + e) n n P{N > x), 
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where in the last inequality we used (|7.17|) and Kq > K-2- Then, by replacing the preceding bound in (17.161) 
and using (|7.15[) . we derive 

P(W n+1 >x)< K E[N}E[C a ] ((1 - e)- a - 1 + 2e) ifP(N > x) 
< K rf l+1 P(N > x) 

for all x > xq and all 1 < n < c log x / | log r\ 1 . 

Finally, for n > clogx/| log 77 1, we follow a different approach that conies from our moment estimates for 
W n . Let 

1 > 



E[N] max{ J B[C Q ], E[C}} 
and note that by convexity 

E[N]m&x{E[C a - s },E[C}} < E[N}m&x{E[C% E[C}} = (1 + e)" 1 ??. 



Then, by Markov's inequality and Lemma 13.21 we have 

P(W n >x)< E[W«- s ]x- a+s 

< K a - S (E[N] m&x{E[C a - d }, E[C\}) n x 
= K a _ s (l+e)- n r, n x- a+s 

< K a - S x- loe{1+£}c/lloevl 7 1 n x- a+5 (7.20) 
for all x > 0. Note that the preceding bound, 



I log 77 1 I log t? I b \E[N]max{E[C a ],E[C]} 
and f7T2U|) yield 

P(W n >*)< K a ^ s rfx-^- a+s 

< K a _ 5 r, n x- a+5 -^ = K a ^ s rf^—^P(N > x) 

L(x) 

< K a - S sup t —— - ?fP(N > x) 

t>i L\t) 

for all x > 1; recall that 8 < 07. Thus, setting K = maxjiCo, K a _s sup t>1 t 5 ~ cl (L^))^ 1 } completes the 
proof. □ 

7.4. The case when Q dominates 

We end the paper with the proof of Lemma 16.11 (the proof of Lemma 16.21 is basically the same) and a sketch 
of the proof of Proposition 16.11 As mentioned before, the proofs of the other results presented in Section |6] 
have been omitted since they are very similar to those from Section [5] 



Proof of Lemma \6.1\ We proceed by induction in n. By Lemma 4.2 from |21j 

P(CQ > x) ~ E[C a ]P(Q > x), 
by Lemma 3.7(1) from the same source, 



P ( J2 C i 1] Qf ] > x ] ~ E[N]P{CQ > x) ~ E[N]E[C a ]P{Q > x), 

i=l 
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and by Lemma 3.1, again from the same source, we have 

/ N (0) \ 

~Pl Y^C^Q^>x\ + P(Q>x) 
~ (p a + 1)P(Q > x). 

Now suppose that we have 

n 

P(RW>x)~J2p k « p (Q >x ^ 

k=0 



Then, 



/ N (0) 

p(p(" +1 ) > x) = p I c i 1)i? i n) + Q m > x 

< N (S» 



p J2 c i R i >x ] + p (Q> x ) 



P[iV]P[C* Q ]P(P (Tl) > a;) + P(Q > x) 



[PaJ2p k a + l )P(Q>X) 

n+1 

Y J p k a P{Q>*)- 



k=0 



□ 



Sketch of the proof of Proposition lfi.il By Markov's inequality 

P(N >x)< E[N a+v ]x- a - u 

for all x > 0. Use Lemma [7T4l to obtain 

P(W n+ i > x) < AjP[iV]P[C Q ]e7/ Tl P(Q > x) + E[N]P(CW n > (1 - e)x) 
for all n.Q < n < Klogx and all x > x\ (for suitably chosen constants e, no, n). Choose xq > 1 V x\ such that 

P(CQ > (1 - e)x) < E[C a ](l - e)- Q - 1 P(Q > x). 
The rest of the proof continues as in Proposition 15.11 with some modifications. □ 
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